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Abstract 

In probability theory, there is a tendency to treat one random variable with a given 
distribution as being just as good as any other. By and large this is fine because 
probability is (mostly) concerned with distributional properties of random variables. 
However, every now and again we are forced to deal with non-distributional properties. 
In this paper we investigate how different random variables with the same distribution 
can be. Specifically, we construct random variables that are all uniformly distributed 
on the unit interval, but that nonetheless have strikingly different properties. 

In probability theory, there is a tendency to treat one random variable with a given 
distribution as being just as good as any other. By and large this is fine because probability 
is (mostly) concerned with distributional properties of random variables. However, every 
now and again we are forced to deal with non-distributional properties. For example, when 
working with Levy processes, one often works with a cadlag version of the process even 
though the property of having cadlag paths is not distributional. Or, when working with 
Markov processes, one often works with the canonical version of the process on path space 
even though this is only one particular instance of the process. These examples highlight a 
natural question: given two random variables with the same distribution, how different can 
they really be? 

In this note, we will mostly restrict ourselves to considering random variables that are 
uniformly distributed on [0,1]. Throughout this note we let B[0, 1] be the Borel f2-algebra on 
[0, 1] and let A be Lebesgue measure on [0, 1] restricted to Borel sets. We will let (fi, P, J 7 ) 
denote an arbitrary, but not fixed, probability space. Recall that a measurable function 
X : Q [0, 1] is called uniformly distributed if for all A e B[0, 1] we have P(X~ 1 (yl)) = X(A). 
From this point on, we will adopt the usual probabilistic convention of using P(X e A) 
instead of P(X" 1 (A)). 
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Let us begin our investigation by looking at the following scenario. Suppose that X and 
Y are uniformly distributed random variables (not necessarily defined on the same basic 
probability space) and that h : [0, 1] — > R is a function such that h(X) and h(Y) are Borel 
measurable. Does it follow that h(X) =d h(Y); that is, are h(X) and h(Y) necessarily equal 
in distribution? When h itself is measurable the answer is clearly yes, but the case when h 
is nonmeasurable is less clear. We will show that the answer is no, h(X) and h(Y) need not 
be equal in distribution. For the sake of full disclosure, we remark that this answer depends 
on using the Axiom of Choice. Later on we will comment on what can happen if the Axiom 
of Choice is not assumed. 

Our first result is the following theorem. 

Theorem 1. Let // and v be two probability measures on R. There exist uniformly distributed 
random variables X : fli — > [0, 1] and Y : f2 2 — > [0, 1] and a function h : [0, 1] — > R such that 
h(X) is measurable and has law fi and h(Y) is measurable and has law v. 

In fact, we can do even better (or worse, depending on your perspective). 

Theorem 2. Let I be an index set with cardinality at most that of the continuum and let 
{^a}a£i be a collection of probability measures on R. There exists a family {X a } aeI of 
uniformly distributed random variables and a (single) function h : [0, 1] — > R such that, for 
all a, h(X a ) is measurable and has law fi a . 

Another way to look at the difference between two random variables is to look at the set 
of values they take. More precisely, suppose that X : fli — > [0, 1] and Y : f2 2 — > [0, 1] are 
uniformly distributed. What can we say about X(Qi) fl Y(Q 2 )7 Since A({x}) = for all 
x G [0, 1] there is no particular x G [0, 1] that must be in this intersection, but how small 
can it really be? Could it be empty? This last question can be phrased more dramatically: 
is there a uniform random variable X such that we can remove all of the values it obtains 
from [0, 1] and then build another uniform random variable that only takes values in what 
is left? Perhaps surprisingly, the answer is yes. We will prove the following strong version 
of the classical fact that a uniform random variable does not take any particular value. 

Theorem 3. There exist uniformly distributed random variables X : f^i — > [0, 1] and Y : 
n 2 -> [0, 1] such that X(Qi) n Y(Q 2 ) = 0. 

Again, we can prove a stronger version. 

Theorem 4. Let I be an index set with cardinality at most that of the continuum. There 
exists a family {X a } ae i of uniformly distributed random variables defined on probability 
spaces {fi a } ae ; such that X a (fl a ) fl XpiVLp) = whenever a ^ f3. 

We remark that Theorem H] is quite similar to [8, Example 7.30], which proves essentially 
the same result using normally distributed random variables instead of uniformly distributed 
ones. Both results are essentially probabilistic reformulations of a classical theorem from 
measure theory that we will come to in due course (for the impatient reader, it is Theorem 
[6] below). 

Let A* and A* be Lebesgue inner and outer measure respectively. We can also prove the 
following theorem. 
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Theorem 5. There exists a family {X Q } ae [ 0i i] of uniformly distributed random variables 
defined on probability spaces {fi a } Q £[o,i] such that whenever a ^ (3 we have 

A*(X a (ft Q ) n Xp(Q p )) = min(a, 0) and X*{X a {Q a ) n X^Vtp)) = max(a, (3). 

Theorems [I] and [2] can be obtained as consequences of Theorems [3] and H] respectively. 

Proof of Theorem Let {X a } a( zj of uniformly distributed random variables defined on prob- 
ability spaces {{l a } a( zi as in Theorem HJ For a G /, let F a be the cumulative distribution 
function for \x a and let F~ l be its right- continuous inverse. Define h : [0, 1] — > R by 

h(x) =^F-\x)i(x e x a (n Q )). 

Note that for each 16 [0, 1] at most one term in the sum is non-zero, so there is no need to 
worry about the fact that it is summing over a large index set. □ 

It remains to prove Theorem 0J The first step is to figure out exactly which subsets 
of [0, 1] can be the range of a uniform random variable. Suppose that X : Q — > [0, 1] is 
uniformly distributed and consider X(fl). Observe that X(Q) must intersect every set of 
positive measure since for all A G B[0, 1] we have P(X G A) = P[X G (A n X(Q))]. Thus 
X*(X(Q)) = 1. In fact, this condition is also sufficient for a subset of [0, 1] to be the range 
of a uniform random variable. Precisely, 

Proposition 1. A subset of [0, 1] is the range of some uniform random variable if and only 
if it has outer measure 1 . 

Proof. From our discussion above, it remains to show that if A C [0, 1] has outer measure 
1 then there is a uniform random variable with range A. We will construct such a random 
variable explicitly. Let Ba[0, 1] = {C fl A : C G B[0, 1]}. It is not hard to check that Ba is a 
ex-algebra on A. 

Claim. IfB,Ce B[0, 1] and B n A = C n A then X(B) = A(C) . 

Proo/ o/ claim. In this case we have AnBnC = AnB = AnC, so that B\(BnC) C [0, 1]\A 
Since A*(A) = 1 this implies that A(5 \(Bn C)) = 0, so that A(5) = A(fi n C). Similarly, 
A(C) = A(BflC), which proves the claim. □ 

We define A^ : B A [0,1] -)■ [0,1] by A A (C ni) = A(C). The claim shows that A A is 
well defined. It is straightforward to check that Xa is a probability measure on (A, jBa[0, 1]). 
Define X : (A, X A , B A [0, 1]) -> ([0, 1], B[0, 1]) by X(x) = ar. It follows immediately that X is 
uniformly distributed and has range equal to A □ 

Proposition [1] reduces the proof of Theorem H] to finding continuumly many pairwise 
disjoint subsets of [0, 1] each of which has outer measure 1. Fortunately for us, this is a 
classical result from measure theory due to Lusin and Sierpinksi, 
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Theorem 6 (Lusin and Sierpinski |5J). There exists a partition of [0,1] into continuumly 
many pairwise disjoint sets, each of which has outer measure 1. 

The original reference for this Theorem is [5], and this paper has recently been reprinted 
in [6j p. 26]. See also [TJ for a slightly more general version where [0, 1] is replaced by an 
arbitrary closed set and the condition of having outer measure 1 is replaced by having full 
outer measure. 

Something like the Axiom of Choice is needed to prove Theorem [61 To see this, let 
(A a ) a& j be a partition of [0,1] into continuumly many sets such that X*(A a ) = 1 for all 
a £ I. Observe that for each a, A a must be non-Lebesgue-measurable since we also have 
A*([0, 1] \ A a ) — 1. The existence of non-measurable sets is connected, to some degree, with 
the Axiom of Choice. For example, there exists a model of the Zermelo-Frankel axioms of set 
theory (without Choice) in which the set of real numbers is a countable union of countable 
sets (see [3j Theorem 10.6]). In such a model, all sets are Borel, so nothing like what we have 
been discussing here can happen. Of course, it is also near impossible to do any analysis in 
such a model. Even if we allow enough Choice to do most analysis, things can go wrong. For 
instance (under some additional assumptions) there models of the Zermelo-Frankel axioms 
with the Axiom of Dependent Choice in which every set of reals is Lebesgue measurable (see 
HHU for details). 

Proofs of Theorems^ and\^ For Theorem HI let (A a ) a& i be a partition of [0, 1] as in Theo- 
rem [6] and for each a E I let X a be a uniform random variable with range A a , as guaranteed 
to exist by Proposition [TJ For Theorem[5], let (A*)ae[o,i] be a partition of [0, 1] as in Theorem 
[6j Define B a = A a U [0, a] and each a G [0, 1] let X a be a uniform random variable with 
range B a , again as guaranteed to exist by Proposition [TJ □ 

For the proof of Theorem [6] we refer the reader to the references mentioned above. We 
content ourselves to sketch the proof of the following weaker theorem, 

Theorem 7. There exists a partition of [0, 1] into two sets, each of which has outer measure 
1. 

For us, the advantage of proving Theorem [7] instead of Theorem [6] is that we can prove 
it using a minor variation of the standard construction of a non-measurable set. We follow 
the approach in [2J Section 16], to which we refer for a more detailed version of the proof. 

Proof. Let ( be an irrational number and let B = {n + m( : n,m G Z}. Further define 
B e = {n + m( G A : n is even} and B = B \ B e . It is not difficult to see that B, B e , and 
B D are each dense subsets of 1R. We define an equivalence relation on M by x ~ y if and only 
if x — y G B. Using the Axiom of Choice, we let C C R be a set containing exactly one 
representative of each equivalence class in M/ ~. Let M = C + B e . Note that the difference 
set D(M) := {mi -m2 : mi,ffi2 G M} is disjoint from B a . However, the difference set of any 
measurable set with positive Lebesgue measue contains an open interval about the origin. 
Since B Q is dense, it follows that M has no subsets of positive measure. Furthermore, we see 
that IR \ M = M + 1, so the complement of M also contains no subset of positive measure. 
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It follows that if we define A 1 = M n [0, 1] and A 2 = [0, 1] \ M then {A 1 , A 2 } is a partition 
of [0, 1] into two sets, each of which has outer measure 1. □ 



References 

[1] Alexander Abian. Partition of nondenumerable closed sets of reals. Czechoslovak Math. 
J., 26(101)(2):207-210, 1976. 

[2] Paul R. Halmos. Measure theory. Springer- Verlag, New York, 1974. 

[3] Thomas J. Jech. The axiom of choice. North-Holland Publishing Co., Amsterdam, 1973. 
Studies in Logic and the Foundations of Mathematics, Vol. 75. 

[4] Saharon Shelah. Can you take Solovay's inaccessible away? Israel J. Math., 48(l):l-47, 
1984. 

[5] W. Sierpinski and N. Lusin. Sur une decompostion d'un intervalle en une infinite non 
denumberable d'ensembles non measurables. Comptes Rendus de lAcademie des Sci- 
ences, pages 422-426, 1917. 

[6] Yakov Sinai, editor. Russian mathematicians in the 20th century. World Scientific Pub- 
lishing Co. Inc., River Edge, NJ, 2003. 

[7] Robert M. Solovay. A model of set-theory in which every set of reals is Lebesgue mea- 
surable. Ann. of Math. (2), 92:1-56, 1970. 

[8] Gary L. Wise and Eric B. Hall. Counterexamples in probability and real analysis. The 
Clarendon Press Oxford University Press, New York, 1993. 



5 



